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Description 

BACKGROUND OF THE INVENTION 
5 1 . Field of the Invention 

[0001 ] The present invention relates to novel polynucleotides derived from microorganisms belonging to corynef orm 
bacteria and fragments thereof, polypeptides encoded by the polynucleotides and fragments thereof, polynucleotide 
arrays comprising the polynucleotides and fragments thereof, computer readable recording media in which the nucle- 
10 otide sequences of the polynucleotide and fragments thereof have been recorded, and use of them as well as a method 
of using the polynucleotide and/or polypeptide sequence information to make comparisons. 

2. Brief Description of the Background Art 

15 [0002] Corynef orm bacteria are used in producing various useful substances, such as amino acids, nucleic acids, 
vitamins, saccharides (for example, ribulose), organic acids (for example, pyruvic acid), and analogues of the above- 
described substances (for example, N-acetylamino acids) and are very useful microorganisms industrially. Many mu- 
tants thereof are known. >° 
[0003] For example, Corynebacterium glutamicum is a Gram-positive bacterium identified as a glutamic acid-pro- 

20 ducing bacterium, and many amino acids are produced by mutants thereof. For example, 1 ,000,000 ton/year of L- Q 
glutamic acid which is useful as a seasoning for umami (delicious taste) , 250,000 ton/year of L-lysine which is a valuable ^ 
additive for livestock feeds and the like, and several hundred ton/year or more of other amino acids, such as L-arginine, 
L-proline, L-glutamine, L-tryptophan, and the like, have been produced in the world {Nikkei Bio Yearbook 99, published Lli 
by Nikkei BP (1998)). 

25 [0004] The production of amino acids by Corynebacterium glutamicum is mainly carried out by its mutants (metabolic 
mutants) which have a mutated metabolic pathway and regulatory systems. In general, an organism is provided with 
various metabolic regulatory systems so as not to produce more amino acids than it needs. In the biosynthesis of L- 
lysine, for example, a microorganism belonging to the genus Corynebacterium is under such regulation as preventing 
the excessive production by concerted inhibition by lysine and threonine against the activity of a biosynthesis enzyme ^ 

30 common to lysine, threonine and methionine, i.e., an aspartokinase, (J. Biochem., 65: 849-859 (1969)). The biosyn- 
thesis of arginine is controlled by repressing the expression of its biosynthesis gene by arginine so as not to biosyn- J— 
thesize an excessive amount of arginine {Microbiology, 142. 99-108 (1996)). It is considered that these metabolic CO 
regulatory mechanisms are deregulated in amino acid-producing mutants. Similarly, the metabolic regulation is dereg- QJ 
ulated in mutants producing nucleic acids, vitamins, saccharides, organic acids and analogues of the above-described 00 

35 substances so as to improve the productivity of the objective product. 

[0005] However, accumulation of basic genetic, biochemical and molecular biological data on coryneform bacteria 
is insufficient in comparison with Escherichia coli, Bacillus subtilis, and the like. Also, few findings have been obtained 
on mutated genes in amino acid-producing mutants. Thus, there are various mechanisms, which are still unknown, of 
regulating the growth and metabolism of these microorganisms. 

40 [0006] A chromosomal physical map of Corynebacterium glutamicum ATCC 13032 is reported and it is known that 
its genome size is about 3,1 00 kb {Mol. Gen. Genet, 252. 255-265 (1 996)). Calculating on the basis of the usual gene 
density of bacteria, it is presumed that about 3,000 genes are present in this genome of about 3, 1 00 kb. However, only 
about 100 genes mainly concerning amino acid biosynthesis genes are known in Corynebacterium glutamicum, and 
the nucleotide sequences of most genes have not been clarified hitherto. 

45 [0007] In recent years, the full nucleotide sequence of the genomes of several microorganisms, such as Escherichia 
coli, Mycobacterium tuberculosis, yeast, and the like, have been determined {Science, 277: 1453-62 (1997); Nature, 
393: 537-544 (1998); Nature, 387: 5-1 05 (1997)). Based on the thus determined full nucleotide sequences, assumption 
of gene regions and prediction of their function by comparison with the nucleotide sequences of known genes have 
been carried out. Thus, the functions of a great number of genes have been presumed, without genetic, biochemical 

so or molecular biological experiments. 

[0008] In recent years, moreover, techniques for monitoring expression levels of a great number of genes simulta- 
neously or detecting mutations, using DNA chips, DNA arrays or the like in which a partial nucleic acid fragment of a 
gene or a partial nucleic acid fragment in genomic DNA other than a gene is fixed to a solid support, have been 
developed. The techniques contribute to the analysis of microorganisms, such as yeasts, Mycobacterium tuberculosis, 

55 Mycobacterium bovis used in BCG vaccines, and the like {Science, 278: 680-686 (1997); Proc. Natl. Acad. Sd. USA, 
96: 12833-38 (1999); Science, 284: 1520-23 (1999)). 
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(ii) a data storing device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target amino acid sequence information for determining a function of a polypeptide 
having the target amino acid sequence which is coincident with or analogous to the polypeptide having at least 
one amino acid sequence selected from SEQ ID NOS:3502 to 7001 ; and 

(iv) an output device that shows a function obtained by the comparator. 

30. A method based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , function 
information based on the amino acid sequence, and target amino acid sequence information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 
with the target amino acid sequence information; and 

(iv) determining a function of a polypeptide having the target amino acid sequence which is coincident with or 
analogous to the polypeptide having at least one amino acid sequence selected from SEQ ID NOS:3502 to 

O 

31 . The system according to any one of claims 23, 25, 27 and 29, wherein a coryneform bacterium is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. _j 

00 

32. The method according to any one of claims 24, 26, 28 and 30, wherein a coryneform bacterium is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. „J 

<r 

33. The system according to claim 31 , wherein the microorganism belonging to the genus Corynebacterium is selected ^ 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophiium t Corynebacterium ^ 
acetogtutamicum, Corynebacterium callunae, Corynebacterium hercufis, Corynebacterium titium, Corynebacteri- 
urn meiassecola, Corynebacterium thermoamtnogenes, and Corynebacterium ammoniagenes. ^ 

34. The method according to claim 32, wherein the microorganism belonging to the genus Corynebacterium is selected ££J 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetoglutamicum, Corynebacterium callunae, Corynebacterium hercuiis, Corynebacterium lifium, Corynebacteri- 
um meiassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

35. A recording medium or storage device which is readable by a computer in which at least one nucleotide sequence 
information selected from SEQ ID NOS:1 to 3501 or function information based on the nucleotide sequence is 
recorded, and is usable in the system of claim 23 or 27 or the method of claim 24 or 28. 

40 36. A recording medium or storage device which is readable by a computer in which at least one amino acid sequence 
information selected from SEQ ID NOS:3502 to 7001 or function information based on the amino acid sequence 
is recorded, and is usable in the system of claim 25 or 29 or the method of claim 26 or 30. 

37. The recording medium or storage device according to claim 35 or 36, which is a computer readable recording 
45 medium selected from the group consisting of a floppy disc, a hard disc, a magnetic tape, a random access memory 

(RAM), a read only memory (ROM), a magneto-optic disc (MO), CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM 
and DVD-RW. 



38. A polypeptide having a homoserine dehydrogenase activity, comprising an amino acid sequence in which the Val 
so residue at the 59th in the amino acid sequence of homoserine dehydrogenase derived from a coryneform bacterium 

is replaced with an amino acid residue other than a Val residue. 

39. A polypeptide comprising an amino acid sequence in which the Val residue at the 59th position in the amino acid 
sequence as represented by SEQ ID NO:6952 is replaced with an amino acid residue other than a Val residue. 

55 

40. The polypeptide according to claim 38 or 39, wherein the Val residue at the 59th position is replaced with an Ala 
residue. 
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(iv) screening and analyzing nucleotide sequence information which is coincident with or analogous to the 
target sequence or target structure motif information. 

25. A system based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
s form bacterium, comprising the following: 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

10 (jji) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 

3502 to 7001 with the target sequence or target structure motif information, recorded by the data storage 
device for screening and analyzing amino acid sequence information which is coincident with or analogous to 
the target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

> 

26. A method based on a computer for identifying a target sequence or a target structure motif derived from a coryne- Q_ 
form bacterium, comprising the following: Q 

o 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , and target 

20 sequence information or target structure motif information into a user input device; LU 

(ii) at least temporarily storing said information; rsr 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 
with the target sequence or target structure motif information; and 

(iv) screening and analyzing amino acid sequence information which is coincident with or analogous to the 

25 target sequence or target structure motif information. ^ 

27. A system based on a computer for determining a function of a polypeptide encoded by a polynucleotide having a 

target nucleotide sequence derived from a coryneform bacterium, comprising the following: J— 

C/5 

30 (i) a user input device that inputs at least one nucleotide sequence information selected from SEQ ID NOS:2 LU 

to 3501, function information of a polypeptide encoded by the nucleotide sequence, and target nucleotide OQ 
sequence information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 
35 2 to 3501 with the target nucleotide sequence information for determining a function of a polypeptide encoded 

by a polynucleotide having the target nucleotide sequence which is coincident with or analogous to the poly- 
nucleotide having at least one nucleotide sequence selected from SEQ ID NOS:2 to 3501 ; and 

(iv) an output devices that shows a function obtained by the comparator. 

40 28. A method based on a computer for determining a function of a polypeptide encoded by a polypeptide encoded by 
a polynucleotide having a target nucleotide sequence derived from a coryneform bacterium, comprising the fol- 
lowing: 

(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501, function in- 
45 formation of a polypeptide encoded by the nucleotide sequence, and target nucleotide sequence Information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501 with 
the target nucleotide sequence information; and 

(iv) determining a function of a polypeptide encoded by a polynucleotide having the target nucleotide sequence 
so which is coincident with or analogous to the polynucleotide having at least one nucleotide sequence selected 

from SEQ ID NOS:2 to 3501 . 

29. A system based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

55 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , function information based on the amino acid sequence, and target amino acid sequence infor- 
mation; 
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